Text Gestalt: Stroke-Aware Scene Text Image Super-resolution
نویسندگان
چکیده
In the last decade, blossom of deep learning has witnessed rapid development scene text recognition. However, recognition low-resolution images remains a challenge. Even though some super-resolution methods have been proposed to tackle this problem, they usually treat as general while ignoring fact that visual quality strokes (the atomic unit text) plays an essential role for According Gestalt Psychology, humans are capable composing parts details into most similar objects guided by prior knowledge. Likewise, when observe image, will inherently use partial stroke-level recover appearance holistic characters. Inspired we put forward Stroke-Aware Scene Text Image Super-Resolution method containing Stroke-Focused Module (SFM) concentrate on internal structures characters in images. Specifically, attempt design rules decomposing English and digits at stroke-level, then pre-train recognizer provide attention maps positional clues with purpose controlling consistency between generated image high-resolution ground truth. The extensive experimental results validate can indeed generate more distinguishable TextZoom manually constructed Chinese character dataset Degraded-IC13. Furthermore, since SFM is only used guidance training, it not bring any time overhead during test phase. Code available https://github.com/FudanVI/FudanOCR/tree/main/text-gestalt.
منابع مشابه
Super-Resolution Enhancement of Text Image Sequences
Abstract The objective of this work is the super-resolution enhancement of image sequences. We consider in particular images of scenes for which the point-to-point image transformation is a plane projective transformation. We first describe the imaging model, and a maximum likelihood (ML) estimator of the super-resolution image. We demonstrate the extreme noise sensitivity of the unconstrained ...
متن کاملCorpus based coreference resolution for Farsi text
"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...
متن کاملText Super-Resolution: A Bayesian Approach
We address the problem of text super-resolution: given an image of text scanned in at low resolution from a piece of paper, return the image that is mortly likely to be generated from a noiseless high-resolution scan of the same piece of paper. In doing so, we wish to: (1) avoid introducing artifacts in the high-resolution image such as blurry edges and rounded corners, (2) recover from quantiz...
متن کاملAn Introduction to Super-Resolution Text
This chapter examines the field of super-resolution with application to text analysis. While the area of super-resolution has been dealt with in fair depth in recent years, it is only just becoming useful as an applicable stage in improving text images, particularly for further processing, transmission, and understanding on mobile and handheld devices. After dealing with the general concepts of...
متن کاملSuper-resolution Text using the Teager Filter
We propose a super-resolution technique specifically aimed at enhancing low-resolution text images from handheld devices. The Teager filter, a quadratic unsharp masking filter, is used to highlight high frequencies which are then combined with the warped and interpolated image sequence following motion estimation using Taylor series decomposition. Comparative performance evaluation is presented...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i1.19904